Machine learning models for lung cancer classification using array comparative genomic hybridization
نویسندگان
چکیده
Array CGH is a recently introduced technology that measures changes in the gene copy number of hundreds of genes in a single experiment. The primary goal of this study was to develop machine learning models that classify non-small Lung Cancers according to histopathology types and to compare several machine learning methods in this learning task. DNA from tumors of 37 patients (21 squamous carcinomas, and 16 adenocarcinomas) were extracted and hybridized onto a 452 BAC clone array. The following algorithms were used: KNN, Decision Tree Induction, Support Vector Machines and Feed-Forward Neural Networks. Performance was measured via leave-one-out classification accuracy. The best multi-gene model found had a leave-one-out accuracy of 89.2%. Decision Trees performed poorer than the other methods in this learning task and dataset. We conclude that gene copy numbers as measured by array CGH are, collectively, an excellent indicator of histological subtype. Several interesting research directions are discussed.
منابع مشابه
Supervised Classification of Array CGH Data with HMM-Based Feature Selection
MOTIVATION For different tumour types, extended knowledge about the molecular mechanisms involved in tumorigenesis is lacking. Looking for copy number variations (CNV) by Comparative Genomic Hybridization (CGH) can help however to determine key elements in this tumorigenesis. As genome-wide array CGH gives the opportunity to evaluate CNV at high resolution, this leads to huge amount of data, ne...
متن کاملMolecular Dissection Using Array Comparative Genomic Hybridization and Clinical Evaluation of An Infertile Male Carrier of An Unbalanced Y;21 Translocation: A Case Report and Review of The Literature
Chromosomal defects are relatively frequent in infertile men however, translocations between the Y chromosome and autosomes are rare and less than 40 cases of Y-autosome translocation have been reported. In particular, only three individuals has been described with a Y;21 translocation, up to now. We report on an additional case of an infertile man in whom a Y;21 translocation was associated wi...
متن کاملGenomic copy number analysis of non-small cell lung cancer using array comparative genomic hybridization: implications of the phosphatidylinositol 3-kinase pathway.
Genomic abnormalities at 348 loci encoding genes that may contribute to lung cancer transformation and progression were assessed using array comparative genomic hybridization in 21 squamous carcinomas (SqCas) and 16 adenocarcinomas (AdCas). Hierarchical clustering showed a clear pattern of gains and losses for the SqCas, whereas the pattern for AdCas was less distinct. Cross-validated classific...
متن کاملGenetic classification of lung adenocarcinoma based on array-based comparative genomic hybridization analysis: its association with clinicopathologic features.
The array-based comparative genomic hybridization using microarrayed bacterial artificial chromosome clones allows high-resolution analysis of genome-wide copy number changes in tumors. To analyze the genetic alterations of primary lung adenocarcinoma in a high-throughput way, we used laser-capture microdissection of cancer cells and array comparative genomic hybridization focusing on 800 chrom...
متن کاملImplications of the Phosphatidylinositol 3-Kinase Pathway Cancer Using Array Comparative Genomic Hybridization : Genomic Copy Number Analysis of Non-small Cell Lung
Genomic abnormalities at 348 loci encoding genes that may contribute to lung cancer transformation and progression were assessed using array comparative genomic hybridization in 21 squamous carcinomas (SqCas) and 16 adenocarcinomas (AdCas). Hierarchical clustering showed a clear pattern of gains and losses for the SqCas, whereas the pattern for AdCas was less distinct. Cross-validated classific...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings. AMIA Symposium
دوره شماره
صفحات -
تاریخ انتشار 2002